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If you are taking MST121 and MS221 together, then we suggest that you use 
the MS221 Handbook for both courses. 


The Greek alphabet 


A ay alpha N V nu 

B B beta i} € xi 

T Y gamma O Co) omicron 
A ) delta I T pi 

E E epsilon P p rho 

Z € zeta yu ol sigma 
H n eta oT 7 tau 

e) 0 theta Y v upsilon 
I L iota ® o) phi 

K K kappa xX xX chi 

A r lamda wv yw psi 

M m mu Q Ww omega 


SI units 


The International System of units (SI units) is an internationally agreed set of units 
and symbols for measuring physical quantities. 


Some of these are base units, such as 


metre symbol m (measurement of length), 
second symbol s (measurement of time), 
kilogram symbol kg (measurement of mass). 


Prefixes may be added to units. Commonly used prefixes are 


centi or 10? (e.g. centimetre, cm), 
milli or 107° (e.g. millisecond, ms), 
micro or 107° (e.g. microsecond, ps), 
kilo or 10° (e.g. kilogram, kg), 
mega or 10° (e.g. megagram, Mg). 


7 = Be 


There are also derived units, which are used for quantities whose measurement 
combines base units in some way. Some of these are 


area Ta (metres squared or square metres), 
volume m?® (metres cubed or cubic metres), 
velocity nis? (metres per second), 

acceleration ms? (metres per second per second). 
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Notation 


Some of the notation used in the course is listed below. The right-hand column gives 


the chapter 


N 
Z 
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and page of MST121 where the symbol is first used. 


the set of natural numbers: 1,2,3,... 
the set of integers: ...,—3,—2,—1,0,1,2,3,... 


the set of rational numbers, that is, numbers of the form p/q 
where p and q are integers, q # 0 


the set of real numbers 

greater than 

less than 

greater than or equal to 

less than or equal to 

a to the power n 

the non-negative square root of a, where a > 0 
the nth root of a, where a > 0 

1/a”", where a £0 

Va™ (or (%/a)™), where n > 0 and a > 0 


the term of a sequence with subscript n, or the sequence whose 
general term is a, 


infinity 
— oo used to describe the long-term behaviour of sequences; 
the arrow is read as ‘tends to’ 


the Cartesian coordinates of a point, where x is the 
displacement along the horizontal axis and y is the 
displacement along the vertical axis 


approximately equal to 

(sin #)?; a similar notation is used for other powers and other 
trigonometric functions 

a function 

specifies (in terms of x) the rule of the function f 


the interval consisting of the set of all numbers between a and 
b, including a and b themselves 


the interval consisting of the set of all numbers between a and 
b, but not including a and b themselves 


the interval where the endpoint a is included but 6 is not; 
similarly (a, b] includes 6 but not a 

the interval consisting of all numbers greater than a 

‘maps to’ for variables; used to specify the rule of a function, 
for example, x +—> e* 

‘maps to’ for sets; used in functions to show the domain 
mapping to the codomain, for example, [0,4] — [0, 16] 

the modulus (magnitude or absolute value) of the real 
number x 
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exp 
7 

arccos & 
arcsin x 


arctan x 


log, 
In 


n 
doa 
i=l 


lim a,, 


nm—oo 


the base for the natural logarithm function and the 
exponential function; e = 2.718 281... 


the exponential function 

the inverse function of the one-one function f 

the angle in the interval [0,7] whose cosine is x 

the angle in the interval [— 37, $7] whose sine is x 

the angle in the interval (—47, $7) whose tangent is x 
the logarithm function to the base a 


the natural logarithm function, that is, log, where 
e = 2.718281... 


the sum a, + a2 +---+4, 
the limit of the convergent sequence a, 
the infinite sum a; +a 2+--- 


a matrix 

the product of the matrices A and B 

the nth power of the square matrix A 

the sum of the matrices A and B 

the scalar multiple of the matrix A by the real number & 
the difference of the matrices A and B 

the element in the ith row and jth column of the matrix A 
a vector 

the 7th component of the vector v 

the identity matrix 

the inverse of the invertible matrix A 

the determinant of the square matrix A 

the matrix form of a pair of simultaneous linear equations 


the zero vector 


the Cartesian unit vector @ 


the Cartesian unit vector @ 


the magnitude of the vector a 


the displacement vector from P to Q 


the position vector of Q 


a= a,i+d2j the component form of the vector a 


2 Z2Hes 


the magnitude of the acceleration due to gravity 
weight (a vector) 

tension (a vector) 

normal reaction (a vector) 


the (first) derived function of f 
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the (first) derivative of the function f at the point x 
the second derived function of f 


the second derivative of the function f at the point x 


the (first) derivative of y with respect to x (Leibniz notation) 
the second derivative of y with respect to x (Leibniz notation) 


d 
a variation of oo 


dx 


a variation of the Leibniz notation for f’(x) 

the (first) derivative of s with respect to t, where t is time 
(Newton’s notation) 

the second derivative of s with respect to t, where t¢ is time 


(Newton’s notation) 


the indefinite integral of f(a) with respect to x 


the indefinite integral of the function f 
the definite integral of f(x) from a to b 


the definite integral of the function f from a to b 


F(b) — F(a) 

shorthand for the initial condition y = 6 when x = a 

the probability that the event EF occurs 

the probability that a random variable X takes the value j 


the mean of a probability distribution, or a population mean 


the sample mean 
the sample standard deviation 


the standard deviation of a probability distribution, or a 
population standard deviation 


a standard error, that is, the standard deviation of a sampling 
distribution 


an estimated standard error 

the lower quartile 

the upper quartile 

the null hypothesis of a hypothesis test 

the alternative hypothesis of a hypothesis test 


Glossary 


Below is a glossary of terms used in MST121. First the definition of the term is 
given, then the page in this handbook where more detail can be found, and finally 


the chapter and page of MST121 where the term is first used. 
absolute value (of a real number) See modulus (of a real 
number x). 


acceleration The rate of change of velocity. 


acceleration due to gravity The magnitude, g, of the acceleration 
with which an object falls. On Earth, g = 9.8 ms~?. 


alternative hypothesis The hypothesis that is accepted at the end 
of a hypothesis test when the null hypothesis is rejected. 


arbitrary constant See constant of integration. 


arccosine ‘The inverse function of the cosine function with domain 
restricted to [0, 7]. 


arcsine The inverse function of the sine function with domain re- 
; 11 
stricted to [—}7, 57]. 


arctangent The inverse function of the tangent function with domain 
restricted to (—}7, $7). 


arithmetic mean The arithmetic mean (or average) of a set of n 
numbers is the sum of the numbers divided by n. 


arithmetic sequence A sequence in which each term (apart from 
the first) is obtained by adding a fixed number to the previous term. 


asymptote A line which a curve approaches (arbitrarily closely) far 
from the origin. 


batch size The number of values in a batch of data. 


bearing A direction given as either North or South followed by an 
angle (up to 90°) towards the East or West. 


bimodal See mode. 


boxplot A diagram consisting of a box and whiskers, which displays 
the median, the quartiles and the minimum and maximum values in a 
batch of data. 


calculus The branch of mathematics which includes the study of dif- 
ferentiation and integration. 


carrying capacity See equilibrium population level. 


Cartesian coordinate system Cartesian coordinates (x,y) specify 
the position of a point in a plane relative to two perpendicular axes, the 
x-axis (horizontal) and the y-axis (vertical). 


Cartesian unit vectors The vectors i= @ and j = (1): 


chaotic sequence A sequence displaying apparently unstructured 
behaviour. 


chord A line segment joining two points on a curve. 
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circle The set of points in a plane that are a fixed distance from a 
specified point in the plane. 


closed form A formula that defines a sequence a, in terms of the 
subscript n. It should be accompanied by a statement of the appropriate 
range for n. 


codomain A set containing all the outputs of a function. See also 
function. 


coefficient matrix The square matrix used when a pair of simulta- 
neous linear equations are written in matrix form. 


coefficient (of a term) The factor by which the term is multiplied 
in a particular product. 


column (of a matrix) See matriz. 


common difference The difference between any two successive terms 
in an arithmetic sequence. 


common logarithm The logarithm function with base 10. 


common ratio The ratio of any two successive terms in a geometric 
sequence. 

completed-square form The completed-square form of x? + 2pz is 
(a+ p)? — p’. 


component form (of a vector) The description of a vector a in 
terms of the Cartesian unit vectors i and j: a = a,i+ agj. 


component (of a vector) See vector. 


composite function A function k with rule of the form 
k(x) = g(f(2)). 

confidence interval An interval of plausible values for a population 
parameter. 


constant (1) A significant number; for example 7. 
constant (2) A term in a mathematical expression, whose value does 
not change during a particular calculation. 


constant function A function f with rule of the form f(x) =, 
where c is a constant. 


constant of integration The constant c in the indefinite integral 
F(z)+e. 

constant sequence A sequence in which each term has the same 
value. 


continuous function Informally, a function is said to be continuous 
if its graph can be drawn without removing the pen from the paper. 


continuous model A model (or representation) in which the asso- 
ciated quantity or quantities can vary throughout some interval of the 
real line. 


continuous variable A variable that can take any value in an interval 
of the real line. 
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convergent sequence A sequence that settles down in the long term 
to values that are effectively constant. Such a sequence is said to 
converge. See also limit (of a sequence). 


cosecant (of an angle @) The cosecant of 0 is 1/sin 0, provided that 
sin 6 # 0. 


cosine (of an angle @) The first coordinate of the point P on the 
circumference of the unit circle, centre O, where the angle between the 
positive x-axis and the line segment OP is 6. By convention, angles 
are measured positively in an anticlockwise direction from the positive 
x-axis. 


Cosine Rule A rule that relates three sides and one angle of a 
triangle. 


cotangent (of an angle #) The cotangent of 6 is 1/tan0@, provided 
that tan dé 4 0. 


cubic expression An expression of the form az? + bar? + cx + d, 
where a £0. 


cycling (of a sequence) The behaviour of a sequence that takes a 
number of different but repeating values; for example, in a 2-cycle the 
sequence settles to a pattern of alternating between two values. 


decay constant The positive constant & in the first-order differential 
equation dm/dt = —km, used to model radioactive decay. 


decreasing function A real function f with the property that for all 
£1, £2 in the domain of f, if x; < x2, then f(x,) > f (x2). 


decreasing on an interval A function f is decreasing on an interval 
T if, for all x, rq in I, if x, < x2, then f(x,) > f (x2). 

definite integral (of a function f(x) from a to b) The definite 
integral of a continuous function f from a to b, denoted by [ ‘i f(x) dz, is 
[F(«)|° = F(b) — F(a), where F is any integral of f. 

degree of a polynomial For a polynomial in x, the highest power 
of x with a non-zero coefficient. 


dependent variable A variable whose value depends on the value of 
another variable (or variables). 


derivative (of a function f at a point x) The gradient of the 
graph of f at the point (x, f(a)), denoted by f’(x). 
derived function (of a function f) The function f’ defined by the 


process of differentiation. 


determinant (of a matrix) The determinant of the 2 x 2 matrix 


(< 1) is ad — be. 


deviation The deviation of a value x from the population mean ju is 
x — p; the deviation of a value x from the sample mean Z is x — @. 


differential equation An equation that relates an independent vari- 
able, x say, a dependent variable, y say, and one or more derivatives of 
y with respect to x. 


differentiation (of a function f) The process of finding the deriva- 
tive f’(x). 
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direct integration A method for solving differential equations of the 
form dy/dx = f(x). 


direction field The association, arising from a differential equation, 
of a gradient with each point (x,y) in a given domain, often represented 
by a collection of short line segments. 


direction (of a vector) The angle 0, measured anticlockwise, that 
an arrow representing the vector makes with the positive x-direction. 


discrete model A model (or representation) in which the associated 
quantity or quantities can only take separated values. 


discrete variable A variable that can only take on values in a sepa- 
rated set, such as the integers. 


displacement (vector) The displacement vector from P to Q is rep- 
resented by the arrow with its tail at P and its tip at Q. 


distance The distance of a particle from the origin is a measure of 
how far it is from the origin, irrespective of direction. 


domain (of a function) The set of allowable input values for a func- 
tion. See also function. 


doubling time The time that it takes for a population to double in 
size from any starting value. 


element (of a matrix) See matriz. 


equal matrices Two matrices are equal if they are of the same size 
and all their corresponding elements agree. 


Equilibrium Condition for forces The relationship between the 
forces acting on an object at rest. 


equilibrium population level The population size at which the pro- 
portionate growth rate is zero; that is, the size at which the population 
remains constant. It is represented by the parameter EF in the logistic 
recurrence relation. 


estimated standard error A value used to estimate the standard 
deviation of a sampling distribution. 


Euler’s method A numerical method for solving initial-value prob- 
lems of the form dy/dx = f(x,y), y(Xo) = Yo- 


explanatory variable When investigating the relationship between 
two variables, the variable whose values ‘explain’ the values taken by 
the dependent variable. 


explicit solution (of a differential equation) A solution written 
in the form y = F(x), where F is a known function. 


exponential function A function with domain R and rule of the 
form f(a) = a® for some positive real number a. The number a is called 
the base of the exponential function. The most important exponential 
function is f(a) = e*, where e = 2.718281.... The function f(x) = e” 
is also written as exp(x). 


exponential model (continuous) A model based on a first-order 
differential equation of the form dy/dx = Ky, where K is a constant. 
Continuous exponential models can be used to model population varia- 
tion and radioactive decay. 
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exponential model (discrete) A model for population variation, 
based on the assumption of a constant proportionate growth rate, r. 
The model is described by the recurrence relation P,41; = (1+ 1r)Ph, 
where P,, is the population at n years after some chosen starting time. 


finite decimal A number whose decimal representation has a finite 
number of decimal places. 


finite geometric series A sum of the form a+ar+ar?+---+ar”. 
finite sequence A sequence with a finite number of terms. 


first-order differential equation A differential equation that in- 
volves the first derivative, dy/dx say, and no higher derivatives. 


fit value The predicted value of an observation or measurement, 
based on a model fitted to data. It is the value of the dependent vari- 
able which, according to the model, corresponds to a given value of the 
explanatory variable. 


force A push or pull which, if not counteracted, causes the accelera- 
tion of an object. 


force diagram A diagrammatic representation of the forces acting 
on an object. 


frequency diagram A diagram that represents a data set in which 
each value (or group of values) is represented by a rectangle whose 
height is equal to the frequency of that value (or group of values). 


frequency (of a particular value, or of an event) The number 
of times that the value, or event, occurs. 


function A (real) function consists of a subset of R, called the domain, 
and a rule that associates with each x in the domain a unique y in R. 
A function may also be referred to as a mapping or a transformation. 


general solution (of a differential equation) The set of all possi- 
ble solutions of the differential equation, usually involving one or more 
arbitrary constants. 


geometric distribution The distribution of the number of trials 
needed to obtain a success in a sequence of trials of an experiment 
in which the probability of success in each trial is the same. The prob- 
abilities of obtaining the values 1,2,3,... form a geometric sequence. 


geometric form (of a vector) The description of a vector a in terms 
of its magnitude |a| and its direction 0. 


geometric mean The geometric mean of a set of n positive numbers 
is the nth root of the product of the numbers. 


geometric sequence A sequence in which each term (apart from the 
first) is obtained by multiplying the previous term by a fixed number. 


geometric series See finite geometric series and infinite geometric 
series. 


gradient (of a graph at a point) The gradient of the tangent to 
the graph at the point. 


gradient (of a line) See slope (of a line). 
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graph (of a real function f) The set of points (x, f(x)) in the 
Cartesian plane. 


half-life The time that it takes for the mass of a radioactive substance 
to decay to half of its original amount. 


histogram A diagram that represents a data set in which each value 
(or group of values) is represented by a rectangle whose area is propor- 
tional to the frequency of that value (or group of values). 


hypotenuse The longest side of a right-angled triangle. 


i-component (of a vector a) The number a, in the component form 
of the vector a = a,i+ oj. 


identity matrix A square matrix with all the elements on its leading 
diagonal equal to 1 and all the other elements equal to 0. 


image (of x under f) The output of the function f for a given input 
x, that is, the value of f(x). 


image set The complete set of output values of a function. 


implicit differentiation The process of using the Chain Rule to dif- 
ferentiate a function such as z = H(y), where y is a function of x, with 
respect to x. 


implicit solution (of a differential equation) A solution of the 
form H(y) = F(a), where H and F are known functions, and H(y) ¥ y. 


increasing function A real function f with the property that for all 
£1, £2 in the domain of f, if x; < x2, then f(x,) < f(x2). 


increasing on an interval A function f is increasing on an interval 
T if, for all x, rq in I, if x, < x2, then f(x1) < f (a2). 


indefinite integral (of a function f(z)) The expression F(x) + ¢, 
where F(x) is an integral of f(a) and c is an arbitrary constant. 


independent events Two events are independent if the occurrence 
(or not) of one event is not influenced by whether or not the other 
occurs. 


independent variable A variable that can take any value appropri- 
ate to the problem; that is, its value does not depend on the value of 
any other variable. 


infinite decimal A number whose decimal representation has in- 
finitely many decimal places. 


infinite geometric series A sum of the form a+ar-+ar?+-:-:-. 
infinite sequence A sequence that has a first term but no final term. 


initial condition (for a first-order differential equation) A con- 
dition requiring that the dependent variable take a specified value when 
the independent variable has a given value. 


initial-value problem The combination of a first-order differential 
equation and an initial condition. 


integers The positive and negative whole numbers, together with 
zero: ...,—3,—2, —1,0,1,2,3,.... 


integral (of a function f) A function F for which F’ = f. 
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integrand In an integral, the function which is to be integrated. 


integration The process of finding either an integral or the indefinite 
integral of a function. 


intercept A value of x or y where a line (or curve) meets the z-axis 
or y-axis, respectively. The x-intercept is the value of x where it meets 
the x-axis. The y-intercept is the value of y where it meets the y-axis. 


interquartile range The difference between the upper quartile and 
the lower quartile of a batch of data. 


interval An unbroken subset of the real line. 


inverse function The inverse function f~! of a one-one function f 
reverses the effect of f; that is, if f(z) =y, then f~'(y) =a. The 
domain of f~' is the image set of f. 


inverse of a matrix If two square matrices A and B have the prop- 
erty that AB = BA =I, where I is the identity matrix, then each of 
A and B is the inverse of the other. 


invertible matrix A square matrix that has a non-zero determinant, 
and therefore has an inverse. 


irrational number A real number that is not rational, and hence is 
a non-recurring decimal. 


j-component (of a vector a) The number az in the component form 
of the vector a = aji+ ayj. 


leading diagonal (of a matrix) The diagonal of a square matrix 
which starts at the top left and ends at the bottom right of the matrix. 


least squares A method for fitting a line or curve to a set of data 
points in such a way that the sum of the squared residuals is as small 
as possible. 


least squares fit line The straight line fitted using the method of 
least squares. 


left-skew See skewed. 


limit (of a sequence) The value near which a convergent sequence 
settles in the long term. 


limits of integration The numbers a and 6b in the expression 
L f(x) dx. The number a is the lower limit and b is the upper limit. 


linear expression A sum consisting of first powers of the variables 
and a constant. 


linear function A real function with rule of the form f(x) = mz +c, 
for some constants m and c. 


linear recurrence sequence A recurrence sequence with recurrence 
relation of the form x,,,; = rx, +d, where r and d are constants. 


local maximum A stationary point at x = 2 of a function f(x) for 
which f(x) is greater than any other function value in the immediate 
vicinity of xo. 


local minimum A stationary point at x = x of a function f(x) for 
which f(zo) is less than any other function value in the immediate 
vicinity of xo. 
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locus’ A curve defined by a particular property. 


logarithm to the base a_ The inverse function of the exponential 
function f(x) = a*, where a > 0 anda #1. The logarithm function is 
written log, and has domain (0,00). 


logistic model A model for population variation, based on 
the assumption of a proportionate growth rate of the form 
R(P) =r(1 — P/E), where r and F are positive parameters. 


logistic recurrence relation A recurrence relation of the form 
Pri — Ph = rP,(1 — P,,/F), where r and E are positive parameters. 


log-linear plot A plot of the natural logarithm of the dependent 
variable against the independent variable. 


long-term behaviour of asequence The way in which the sequence 
develops as more and more terms are considered. 


lower quartile See quartiles. 
magnitude (of areal number) See modulus (of a real number x). 


magnitude (of a vector a) The length of an arrow representing the 
vector; it is written as |a|. 


main diagonal (of a matrix) See leading diagonal (of a matrix). 


many-one function A function that is not one-one. 


mapping See function. 
mass A measure of the amount of matter that an object contains. 


mathematical model A collection of formulas that attempts to 
quantify how some aspect of the real world behaves. 


matrix A rectangular array of numbers. Each number in a matrix is 
called an element of the matrix. A row of the matrix is a horizontal 
line of numbers in the array, and a column of the matrix is a vertical 
line of numbers in the array. A matrix that has m rows and n columns 
is called an m x n matrix. Matrices of appropriate sizes can be added, 
subtracted and multiplied. 


matrix—vector multiplication The process of multiplying a vector 
by a matrix. 


mean An average of a finite set of numbers. See arithmetic mean, 
geometric mean. 


mean (of a probability distribution, or of a random variable) 
The average value predicted by the probability model. 


median If the values in a batch of data are written in order of in- 
creasing size, then the median is the middle value when the batch size 
is odd, or the average of the middle two values when the batch size is 
even. 


mode _ A peak in a distribution. A distribution that has only one peak 
is called unimodal; a distribution that has two peaks is called bimodal. 


modelling cycle The process of choosing a (mathematical) model, 
trying it out, evaluating it, and possibly changing it. 
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modulus (of a real number x) The magnitude of x, regardless of 
its sign; it is written as |z]. 


natural logarithm The logarithm function with base e, where 
e = 2.718 281...; it is often written as In. 


natural numbers The positive integers: 1,2,3,.... 


network diagram A mathematical representation of a physical net- 
work. Each point at which a network branches is called a node, and 
two nodes of a network may be connected by a pipe. 


newton The SI unit of force. 


node (of a network) See network diagram. 


non-invertible matrix <A square matrix that has zero determinant, 
and therefore has no inverse. 


normal curve The graph of the probability density function of a 
normal distribution. 


normal distribution A particular model for the variation in a con- 
tinuous random variable. 


normal reaction The force acting on an object due to contact with 
a surface, which is directed at right angles to that surface. 


nth root of a The non-negative number which, when raised to the 
power n, gives the answer a. 


null hypothesis A hypothesis about a population (or populations) 
which may or may not be rejected as the result of a hypothesis test. 


one-one function A function f with the property that for all x1, 2» 
in the domain of f, if x; # x2, then f(x.) F f (x2). 


optimisation The process of finding optimum values of a function on 
an interval. 


optimum values (on an interval) The greatest or least values at- 
tained by a function on the interval. 


order (of a differential equation) The order of the highest deriva- 
tive that appears in the differential equation. 


parabola The graph of a quadratic function. 


Parallelogram Rule A rule for the addition of two vectors that are 
in geometric form. 


parameter (1) A variable used when defining a family of mathemat- 
ical objects, such as a recurrence system. 


parameter (2) A variable, often t, used when defining a curve in 
terms of the motion of a point along the curve. 


parametrisation (of a curve) The process of describing the coor- 
dinates of points on the curve in terms of a parameter. 


particle A material object whose size and internal structure may be 
neglected (for modelling purposes). 


particular solution (of a differential equation) A single solution 
of the differential equation, which contains no arbitrary constant. 
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periodic function A function f whose graph is unchanged by a hor- 
izontal translation through the period p; that is, f(a +p) = f(x) for all 
x in the domain. 


perpendicular bisector (of a line segment AB) The line that 
cuts AB halfway along its length and is at right angles to AB. 


pipe (of a network) See network diagram. 


polar coordinates (of a point A) The numbers r and @ for the 
point A with Cartesian coordinates (r cos 0,7 sin 0). 


polynomial A polynomial (of degree n) is an expression of the form 
Ant” + nix" 1 +++++ 4,24 a9, where a, 4 0. 


polynomial function A function whose rule is a polynomial; for ex- 
ample, a quadratic function. 


population The collection of all the individual values or members of 
a specified group of interest. 


population mean The mean of all the values in a population or, 
when a probability distribution is used to model the variation in the 
population, the mean of this probability distribution. 


population parameter A summary measure for a population. The 
population mean yz and the population standard deviation o are exam- 
ples of population parameters. 


population standard deviation The square root of the population 
variance. 


population variance The mean squared deviation of the values in a 
population from the population mean. 


position The position of a particle, with respect to a chosen axis, is 
a measure of how far it is from the origin and of its direction relative 
to the origin. 


position vector A displacement vector whose arrow has its tail at 
the origin. 


n 


power function A function with rule of the form f(a) = x”, where 


n is any real number. 


power (of a square matrix) The nth power of a square matrix A, 
written A”, is obtained by repeated matrix multiplication. 


predicted value See fit value. 


probability density function If a curve is used to model the vari- 
ation in a population, and the total area between the curve and the 
x-axis is 1, then the function that defines the curve is a probability den- 
sity function. According to this model, the proportion of the population 
between two values a and b (a < b) is given by the area under the curve 
between x =a and x =b. 


probability distribution The probability distribution of a random 
variable gives the probabilities of all the possible values that the random 
variable can take. 


probability function The probability function of a discrete random 
variable X is a function that gives, for each value j, the value of the 
probability P(X = j). 
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probability (of an event) The long-run proportion of occasions on 
which the event occurs. 


product matrix The result of multiplying two matrices together. 


proportionate growth rate The proportionate birth rate minus the 
proportionate death rate for a population. 


proportionate growth rate (continuous exponential model) 
The constant K in the differential equation dP/dt = K P, used to model 
population variation. 


quadratic expression An expression in the form az? + br+c, 
where a £0. 


quartic function A polynomial function of degree 4. 


quartiles The median and quartiles of a batch of data divide the 
batch into four roughly equal parts. Roughly 25% of the values in 
the batch are smaller than the lower quartile, and roughly 25% of the 
values are greater than the upper quartile. When the values are written 
in order of increasing size, the lower quartile is the median of the values 
to the left of the median, and the upper quartile is the median of the 
values to the right of the median. 


radius (of a circle) The distance from the centre of a circle to any 
point on the circumference of the circle. 


random variable A quantity that can take different values on differ- 
ent occasions. 


range The range of a batch of data is the difference between the 
maximum and minimum values. 


rational number’ A real number that can be represented as a frac- 
tion, and hence as a recurring decimal. 


real function A function for which both the inputs and the outputs 
are real numbers. 


real line A number line that includes all real numbers. 
real number A number that can be represented as a decimal. 
reciprocal function The function whose rule is f(«) = 1/z. 


recurrence relation A formula that defines each term of a sequence 
by referring to a previous term or terms of the sequence. 


recurrence system ‘The specification of a sequence by an initial term 
or terms, a recurrence relation and a subscript range. 


regression line The regression line of y on x is another name for the 
least squares fit line. See also least squares fit line. 


residual When a line is fitted to a set of data points, the residual of 
each data pair may be calculated using the relationship RESIDUAL = 
DATA — FIT, where DATA is the y-coordinate of the data pair and FIT 
is the y-value predicted by the line for the corresponding x-coordinate. 


resultant The sum of two vectors. 


right-skew See skewed. 
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rise from Ato B_ Therise from a point A(x, y,) toa point B(x, yo) 
iS Yo — Y1- 

root (of an equation) An alternative term for a solution of an 
equation. 


row (of a matrix) See matriz. 


rule (of a function) The process for converting each input value in 
the domain of the function into a unique output value. See function. 


run from Ato B Therun froma point A(x,, y,) toa point B(x, yo) 
is Bg — Li. 


sample A subset of a population. 


sample mean The mean Z of a sample. 


sample standard deviation The sample standard deviation s is the 
square root of the sample variance. 


sample variance The mean squared deviation, from the sample 
mean, of the values in a sample. 


sampling distribution of the mean The distribution of the means 
of all possible samples of size n from a population is called the sampling 
distribution of the mean for samples of size n. 


sampling error An error due to the selection of an unrepresentative 
sample. 


scalar A real number. 


scalar multiplication (of a matrix) The operation of multiplying 
each element of a matrix A by a real number k. The resulting matrix 
kA is called the scalar multiple of A by the real number k. 


scaling (of a graph) Squashing or stretching the graph in the 
x-direction or y-direction. 


secant (of an angle 0) The secant of @ is 1/cos@, provided that 
cos 6 £ 0. 


second derivative (of a function f at a point) The value of the 
second derived function f” at the point. 


second derived function (of a function f) The function f” defined 
by the process of differentiating the derived function f’ of f. 


separable differential equation A differential equation of the form 
dy/dx = f(x)g(y). 


separation of variables A method for solving separable differential 
equations. 


sequence An ordered list (finite or infinite) of numbers, called the 
terms of the sequence. 


series A sum of consecutive terms of a sequence. 
sigma notation A concise way of expressing finite and infinite series. 


significance level The significance level of a test is the probabil- 
ity (often written as a percentage) that the null hypothesis is wrongly 
rejected. 
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sine (of an angle @) The second coordinate of the point P on the 
circumference of the unit circle, centre O, where the angle between the 
positive x-axis and the line segment OP is 6. By convention, angles 
are measured positively in an anticlockwise direction from the positive 
X-axis. 


Sine Rule A rule that relates pairs of sides and the corresponding 
opposite angles of a triangle. 


size (of a matrix) An m x n matrix has size m x n. 


skewed _ A dataset which is not symmetric (or, equivalently, for which 
a frequency diagram is not symmetric) is said to be skewed. If the large 
data values are more spread out than the small data values, so that a 
frequency diagram has a longer right tail than left tail, then the data 
set is right-skew. If the left tail of a frequency diagram is longer than 
the right tail, then the data set is left-skew. The terms right-skew and 
left-skew are also used to describe probability distributions. 


slope (of a line) If A and B are two points on a line, then the slope 
of the line is (rise from A to B) + (run from A to B). The slope is also 
called the gradient. 


smooth A function f is said to be smooth if the derivative exists at 
each point of the domain of f. 


solution of a differential equation A function, y = F(x) say, (or 
a more general equation relating the independent and dependent vari- 
ables) for which the differential equation is satisfied. 


solution (of an equation) A value of the unknown for which the 
equation is satisfied. 


solving atriangle The process of determining all the angles and side 
lengths of a triangle. 


speed A (scalar) measure of how fast an object is moving, irrespective 
of its direction of motion. The speed is the magnitude of the velocity 
vector. 


square matrix A matrix with the same number of rows and 
columns. 


standard error The standard deviation of a sampling distribution. 


standard error of the mean’ The standard deviation of the sam- 
pling distribution of the mean for samples of size n, for any given sample 
size n. 


standard normal distribution The normal distribution with mean 
0 and standard deviation 1. 


stationary point A point x9 in the domain of a smooth function f at 
which f’(x 9) = 0, or the corresponding point (xo, f(ao)) on the graph 
of f. 


step size The value of the variable h in Euler’s method, which deter- 
mines the distance between the successive values of x at which solution 
estimates are calculated. 


subscript In the notation a,, n is called the subscript of a. 
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subtended angle (at a point) The angle that lies between two lines 
drawn from the point to the endpoints of a line segment or an arc of a 
circle. 


in 0 
tangent (of an angle #) tané= a where 6 # +5n,+37,.... 
Cos 


tangent (to a circle) A line that intersects the circle in precisely 
one point. 


tangent (to a graph at a point) The unique line that just ‘touches’ 
the graph at the point; that is, the line that most closely approximates 
the graph near the point. 


tends to infinity (or minus infinity) The terms of a sequence be- 
come arbitrarily large and positive (or arbitrarily large and negative). 


tends to € The terms of a sequence become arbitrarily close to 2. 
tension The force provided by a taut string (rope, etc.). 
term (of a sequence) An item of a sequence. 


test statistic A measure calculated from data, which can be used to 
decide whether or not to reject the null hypothesis. 


transformation See function. 


translation (of a graph) Shifting the graph horizontally or 
vertically. 


triangle of forces A triangle of arrows which is a geometric repre- 
sentation of the Equilibrium Condition for three forces. 


Triangle Rule A rule for the addition of two vectors that are in 
geometric form. 


unbounded sequence A sequence that has terms of arbitrarily large 
value, either positive or negative. 


unimodal See mode. 


unit circle The circle with radius 1 and centre at the origin. 


upper quartile See quartiles. 


variable A symbol used to represent a quantity that can vary. 


vector A matrix consisting of a single column. Each number appear- 
ing in the column is called a component of the vector. 


velocity (vector) The rate of change of position of an object. Ve- 
locity is a measure of how fast the object is moving and its direction of 
motion. 


weight The force on an object due to gravity. 


zero vector The vector in which every component is 0, denoted 
by 0. 
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Background material for MST121 


Rounding numbers 


To round to a given number of decimal places, look at the digit one place to the 
right of the number of places specified. If this digit is 5 or more, then round up; if it 
is less than 5, then round down. 


To round to a given number of significant figures, start counting significant 
figures from the first non-zero digit on the left, and follow the rules for rounding. 


Scientific notation 


In scientific notation, positive numbers are expressed in the form a x 10”, where a is 
between 1 and 10, and n is an integer. 


Rules for powers 
aP x at = aPt4 a? = at = aP-4 (a?)4 = qrP4 ab? = (ab)? 
1 
aq" = q —| qi/n = Ya qm/n = qm = (%/a)™ 
Surds 


An irrational number is in surd form if it is written in terms of roots of rational 
numbers. For example, /2, W5 and </7 + V3 are numbers in surd form. 


Calculating means 


To calculate the mean of a batch of data, add together the values (x) in the batch 
to give }> a, and divide by n, the number of values in the batch. 


Algebra 
Difference of two squares: a? — b? = (a—b)(a+D). 
Squaring a bracket: (a + b)? = a? + 2ab+ 0b. 
The solutions of the quadratic equation ax? + bx + c = 0, where a ¥ 0, are 
—b+ Vv b? — 4dac 
2a ; 
To solve two simultaneous linear equations by substitution. 


— 


1. Rearrange one of the equations so that one unknown is equal to an expression 
involving the other unknown. 


2. Substitute this expression in the other equation. 
Solve the resulting linear equation for the other unknown. 

4. Substitute this solution into either of the original equations to find the remaining 
unknown. 

To solve two simultaneous linear equations by elimination. 


1. Multiply the two equations by numbers chosen so that one of the unknowns has 
the same coefficient, possibly with the opposite sign, in both equations. 


2. Subtract or add the new equations to eliminate that unknown. 
Solve the resulting linear equation for the other unknown. 


4. Substitute this solution into either of the original equations to find the remaining 
unknown. 
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Equivalent rearrangements of inequalities 


If the sides of an inequality are interchanged, then the direction of the inequality 
sign is reversed: > becomes <, > becomes <, and vice versa. 


The same number can be added to (or subtracted from) both sides of an inequality: 
ifa<b, thena+c<b+e. 


Both sides of an inequality can be multiplied (or divided) by the same positive 
number: if a <b and c > 0, then ac < be. 


If both sides of an inequality are multiplied (or divided) by the same negative 
number, then the direction of the inequality sign changes: if a < b andc <0, 
then ac > be. 


Angle measurement 


The angle subtended at the centre of a circle by an arc equal in length to the radius 
of the circle is defined to be one radian. Thus 27 radians = 360°, and the rules for 
converting between degrees and radians are 


180 
x radians = x x —— degrees, y degrees = y Xx —" yadians. 
T 180 
Polygons 


A plane figure which is a closed shape whose sides are straight lines is called a 
polygon. A point where two sides meet is called a vertex. A polygon with n sides 
(and hence n vertices) is referred to as an n-gon. 


The angle sum of an n-gon is (n — 2)180°, that is, (n — 2)z radians. 


An n-gon is said to be regular if all its sides are equal and all its angles are equal. 


Triangles 


A triangle is a polygon with three sides. Its angle sum is 180°, that is, 7 radians. If 
all three sides are of equal length, then it is an equilateral triangle and all three 
angles are 60°. If two sides are of equal length, then it is an isosceles triangle and 
the two angles opposite the equal length sides are equal. 


The area of a triangle is 
© sah, where a is the length of the base and h is the height; 


© jabsin6, where a and b are two side lengths, and @ is the angle between the sides. 


Right-angled triangles 


Pythagoras’ Theorem: For a triangle ABC with side lengths a, b and c (opposite 
A, B and C, respectively), where the angle at C is a right angle, c? = a? + 6?. 


The side opposite the right angle is known as the hypotenuse. 


For this triangle, the trigonometric ratios are 


; opposite a adjacent b opposite a 
sin A = =, cos A = =-, tanA= —— =-—, 
hypotenuse c¢ hypotenuse c adjacent 06 
hypotenuse c hypotenuse adjacent 0 
weued=—” - =-, seccA= = =-, oie SS 
opposite a adjacent opposite a 


é 
b 
It follows from Pythagoras’ Theorem that sin? A + cos? A = 1. 
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Useful trigonometic ratios 


Angle @ in radians 0 a ris 30 $1 

Angle 6 in degrees 0 30 45 60 90 

sin 0 0 4 1/Vv2 V3/2 tL Sine | All — 
positive | positive 

cos @ 1 4/3 /2 If V2 5 0 Tangent | Cosine 
positive | positive 

tan 0 0.1 J3 1 V3 Undefined 


Quadrilaterals 


A quadrilateral is a polygon with four sides. Its angle sum is 360°, that is, 
27 radians. 


© A quadrilateral in which opposite sides are equal is a parallelogram. 
© A quadrilateral in which all sides are equal is a rhombus. 

© A quadrilateral in which all angles are equal (to 90°) is a rectangle. 
© A quadrilateral in which all sides and all angles are equal is a square. 


In a parallelogram, opposite angles are equal and the two diagonals bisect each 
other. In a rhombus, the diagonals bisect each other at an angle of 90°. 


The area of a rectangle is A = 1b, where / is the length and 6 is the breadth. 


Circles 
A circle of radius r has 
© circumference C = 2ar = 7d, where d is the diameter; 


© area A=ar’. 


Congruence 
Two figures are congruent if they have the same shape and the same size. 


Two n-gons are congruent if all corresponding sides and angles are equal. 


Similarity 
Two figures are similar if they have the same shape; their sizes need not be the 


same. 


Two n-gons are similar if each angle in one n-gon is equal to the corresponding angle 
in the other. In this case, the length of each side in one n-gon is the same multiple of 
the corresponding length in the other. 


Prisms 


A prism is a solid with constant cross-section. A cylinder is a prism with circular 
cross-section. 


The surface area of a prism is the sum of the areas of its faces. In particular, the 
surface area of a cylinder is A = 27r? + 27rh, where r is the radius of the circular 
cross-section and h is the length. 


The volume of a prism is the area of its cross-section multiplied by its length. In 
particular, the volume of a cylinder is V = mr?h. 
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Definitions and results in MST121 


The following definitions and results have been collected from the chapters of 
MST121. They are listed in chapter order. If you cannot find the information you 
want here, then try the alphabetical listing in the Glossary. 


MST121 Chapter Al Sequences 


Types of sequences 
Convention: The first term of a sequence has subscript 1, unless otherwise indicated. 


An arithmetic sequence with first term a and common difference d can be 
specified by either of the following recurrence systems: 


© H=6,. tate td (= 12, 3y<i2) 
with closed form z, = a+ (n—1)d (n= 1,2,3,...); 
© CoS Oo Stet C= 01 2,..2), 


with closed form z, = a+nd(n=0,1,2,...). 


A geometric sequence with first term a and common ratio r can be specified by 
either of the following recurrence systems: 


© = c, Wig =—7e, (tH 1,2,5.<..), 


with closed form x, = ar”~' (n= 1,2,3,...); 


© t=O Cage *e, 01,2222), 
with closed form x, = ar” (n= 0,1, 2,...). 


A linear recurrence sequence with parameters a, r and d can be specified by 
either of the following recurrence systems: 


© v= a, Ini =TX, +d a ae ee 
with closed form (when r # 1) 


d ‘i d 
nn a ae ne i a eee 
x (a+—))r ee (n ) 
© =0,. Gea =Te, et (f—0,1,2,<-.), 


with closed form (when r ¥ 1) 


d h d 
o,=(a+—"2)r eT (t= 0,1,2,..058 


Long-term behaviour of sequences 


Range of r_ Behaviour of r” 


r>1 r” — co as n > CO 

r=1 Remains constant: 1,1,1,... 
O0<r<1l r°-Oasn-—c 

r=0 Remains constant: 0,0,0,... 


-l<r<0 r®>0asn—o, alternates in sign 
r=-l Alternates between —1 and +1 
r<-—-l r” is unbounded, alternates in sign 
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Sum of a finite geometric series 


The sum of a finite geometric series with first term a and common ratio r (r ¥ 1) is 


1— n+1 
atartar?+--+ar"=a(—), 
—fr 


MST121 Chapter A2_ Lines and circles 


Lines 
Type Slope Equation 
Parallel to m=0 Y=, 
X-axis where c is a constant 
Parallel to Infinite v= d, 
y-axis where d is a constant 
Not parallel to m= (ye—yi)/(t2-%1), y-—y = m(e — 21) or 
y-axis where (21,41) and (%2,y2) y=mar+c, 
are points on the line where c is the y-intercept 


If two lines are parallel, then they have equal slopes. If two lines are 
perpendicular, then either the product of their slopes is —1 or one has slope 0 and 
the other has infinite slope. 


The distance between two points (x1, y,) and (22, y2) is 


(02 =a Pe — i). 
The midpoint of a line segment with endpoints (2, y,) and (22, ye) is 


(= + 2X2 YW zh) 
2 , 2 , 


Circles 


Geometrically, a circle is the set of points that are at a fixed distance (the radius) 
from a specified point (the centre). Algebraically, a circle with centre (a,b) and 
radius r has the equation 


(w—a)* + (y—b) =r". 


To find the equation of a circle, given three points A, B and C on the circle, find the 
perpendicular bisectors of the line segments AB and BC. The centre of the circle is 
the intersection point of the two perpendicular bisectors. The radius of the circle is 
the distance from the centre to any of the points A, B or C. 


To complete the square of x? + 2px, use 


a? + Ipe = 2? + 2px +p? — p* = (4 +p)? — p’. 


Trigonometry 


Let P(x, y) be a point on the unit circle (with centre O) such that the angle from the 
positive z-axis to OP is 0 (measured anticlockwise if @ is positive, clockwise if 0 is 
negative). Then 


sin 0 


cos? =x, sné=y and tané= (provided that cos 6 # 0). 


Zi 
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Trigonometric identities 


cos (—@) = cos cos (1 — 0) = — cos cos (6 + 27) = cos0 
sin (—0) = —sin@ sin (7 — 0) = sin@ sin (0 + 27) = sin@ 
tan (—0) = —tan@ tan (7 — 0) = —tand tan (@+ 7) = tand 
cos (ia — 0) = sind sin ($7 — 0) = cos cos? 9+ sin? @=1 


Parametrisation of lines and circles 
© A line with slope m passing through the point (x, y,) has parametric equations 
wT=t+r, yo=mt[+y. 


© A line passing through the two points (21, y,) and (x2, y2) has parametric 
equations 


t=a+t(t2—%), y=y+tly—). 
© A circle with centre at (a,b) and radius r has parametric equations 


r=a+rcosé, y=b4+rsind (0<@< 2a). 
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Functions 
A (real) function is specified by giving 
© the domain, that is, the set of allowable input values, which are real numbers; 


© the rule for converting each input value to a unique output value, which is also 
a real number. 


The output of a function f for a given input x is called the image of x under f, and 
is written f(a). The set of all outputs of the function f is called the image set of f. 


Convention: When a function is specified just by a rule, it is understood that the 
domain of the function is the largest possible set of real numbers for which the rule is 
applicable. 


Function notation 

A standard notation used to specify a function f is 
f(zj)=27+1 (0<2<6). 

Other notations used to specify the same function f are 

© f:area?+1 (0< 2 <6); 

© f:[0,6] -R 


areow +i. 


Modulus of a real number 


The modulus, or absolute value, of a real number z is the magnitude of 2, 
regardless of sign, denoted by |z|. Thus 


a= x, ifx>0, 
a —x, ifx<0. 
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Translating and scaling a known graph 


Graph Translation or scaling of y = f(x) 


y= f(x+p) Horizontal translation by p units to the left (right if p is negative) 
y = f(x)+q _ Vertical translation by g units upwards (downwards if q is negative) 
y=af(z) y-scaling with factor a 

y= fbr) x-scaling with factor 1/b 


The scalings and translations in the table above can, with two exceptions, be applied 
in any order with the same result. The exceptions are that the result of applying 
both a horizontal translation and an x-scaling depends in general on the order in 
which these are applied, and similarly for both a vertical translation and a y-scaling. 


Graphing quadratic functions 


To sketch the graph of a quadratic function f(r) = ax? + bx +c, first write the 
function in completed-square form: f(x) = a(x +p)? +q. Then start with the graph 
of y = x? and perform 

1. a y-scaling with factor a; 

2. a horizontal translation by p units to the left (right if p is negative); 

3. a vertical translation by q units upwards (downwards if q is negative). 

The graph is a parabola with vertex (—p,q), which is the lowest point if a > 0 and 
the highest point if a < 0. Its axis of symmetry is x = —p. 

The y-intercept is found by setting x = 0 to give f(0) =c. The z-intercepts (if any) 
are found by setting y = 0 and solving ax? + br + c= 0. 


Inverse functions 

A real function f is one-one if it has the following property: for all x,, #2 in the 
domain of f, if x; A ro, then f(x1) F f(zx2). 

A real function f is increasing if it has the following property: for all #1, in the 
domain of f, if 7; < %, then f(x) < f(x). 

A real function f is decreasing if it has the following property: for all 7), 22 in the 
domain of f, if x; < X2, then f(x,) > f (x2). 


If a function is increasing (or decreasing), then it is one-one. 


When a function f is one-one, an inverse function f~! can be defined which 
reverses the action of f. 


© To obtain the rule for the function f~' (in terms of x), solve the equation 
y = f(x) to obtain x in terms of y, and then exchange the roles of x and y. The 
image set of f is the domain of f~', and vice versa. 


© To obtain the graph of y = f~'(x), reflect the graph of y = f(x) in the 45° line. 
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Inverse trigonometric functions 
The function f(x) = sina (—}7 < a < $7) has an inverse function arcsine with 
domain [—1,1] and image set [-—57, 57. Thus, for -l<y <1, 

x=arcsiny means that y=sina and —jm<a2< $a. 
The function f(x) = cosa (0 < a < 7) has an inverse function arccosine with 
domain {[—1, 1] and image set [0,2]. Thus, for —1 < y <1, 

x=arccosy meansthat y=cosxand0<a2<7. 
The function f(a) = tana (—}m < x < 47) has an inverse function arctangent with 


domain (—oo, oo) and image set (—}7, $7). Thus, for —co < y < oo, 


x=arctany means that y=tanz and — $1 aK ST. 


Logarithms 


An exponential function f(x) = a”, where a > 0 and a 4 1, has domain R and image 
set (0,00). Its inverse function, called logarithm to the base a and denoted by log,, 
has domain (0,00) and image set R. Thus, for y > 0, 


x=log,y meansthat y=a’. 


Combining these results gives 


x=log,(a”), forvinR, and y=a'®¥, for y>0. 


The natural logarithm has base e = 2.718281... and is often written as In. The 
common logarithm has base 10 and is often written as log. 


Provided that a > 0 and a 4 1, the logarithm to the base a has the following 
properties: 
(a) log, 1=0, leg, a= 1; 
(b) for > 0 and y > 0, 

(i) log, (xy) = log, x + log, y, 

(ii) log, (x/y) = log, x — log, y; 
(c) forge >OandpinR,  log,(x”) = plog, z. 
To use logarithms to solve an equation of the form a® = k, where k > 0 and 
a>0,a#1, apply the function In to both sides of the equation, and use 
property (c) to obtain 

_ Ink 


ae 
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MST121 Chapter B1 Modelling with sequences 


Formulas for sums 


Soo Scr teats +e 


i=1 


n 


: t—rrti 
Yar =atar+ar?+---+ar"=a(——"—) (¢ #1) 
i=0 =f 
eh 2d a 
a _ 1 
22 i (|r| < 1) 


n n 


Sete put eee a 


t=1 4=1 w=m 1=™M 


S0i=142434---4+n=in(n+1) 


i=1 


Exponential model 


The (discrete) exponential model for population variation is based on the assumption 
of a constant proportionate growth rate, r. The model is described by either the 
recurrence relation 

Pe =OUstnP, (e012 iss), 
or its closed-form solution 

P= +r. (8 =O, 1525 2..); 


where P,, is the population size at n years after some chosen starting time. The 
proportionate growth rate r is the proportionate birth rate minus the proportionate 
death rate. 


Logistic model 


The logistic model for population variation is based on the assumption of a 
proportionate growth rate R(P) of the form R(P) = r(1— P/E), where r and E are 
positive parameters. The model is described by the recurrence relation 


Pr 
Puy — Py =P (1-4) (= 01,25 2:2)5 


where P,, is the population size at n years after some chosen starting time. The 
positive constant r represents the proportionate growth rate of the population when 
the population size is small, and the positive constant F represents the equilibrium 
population level (the population size at which the proportionate growth rate is zero). 


The long-term behaviour of sequences generated by the logistic recurrence relation 
(with 0 < Py < E(1+1/r)) depends on the value of r, as shown in the table below. 


Range of r Long-term behaviour of P,, 

O<r<l Settles close to (converges to) FE, with values always just below F 

Lr <2 Settles close to E, with values alternating between just above and just below E 
2<r<2.44 2-cycle, with one value above F and one value below E 

2.45 <r <2.54 4-cycle, with two values above F and two values below E 

2.6<r<3 Chaotic variation between bounds (with some exceptions) 
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Convergence and limits 


If a sequence P,, settles down in the long term to values that are effectively constant, 
then P,, is said to be convergent. The value near which P,, settles in the long term 
is called the limit of the sequence, written 


lim P,,. 


n—oco 


If a sequence P,, given by a recurrence relation converges, then it converges to the 
value of a constant sequence that satisfies the recurrence relation. To find such 
values, substitute P,, = c and P,,,,; = c into the recurrence relation and solve for c. 


Reciprocal Rule 


If the terms of a sequence b,, are of the form 1/a,, where terms of the sequence a,, 
become arbitrarily large as n increases, then lim 6b, = 0. 


Constant Multiple Rule 


If the terms of a sequence 6, are of the form ca,, where lim a, = 0, and cis a 


constant, then lim 6, = 0. 


Long-term ‘basic sequence’ behaviour 
The long-term behaviour of the sequence r” (n = 1,2,3,...) is as follows. 


© If |r| > 1, then |r”| > co asn — of. (If r > 1, then r” > oo as n — on; if 
r <—1, then r” is unbounded and alternates in sign.) 


© If |r| <1, then r” > 0 as n > ov. 

© Ifr=1, then r” =1. Ifr=-—1, then r” alternates between 1 and —1. 
The long-term behaviour of the sequence n? (n = 1,2,3,...) is as follows. 
© Ifp>0, then n? > co as n— oo. 

© Ifp<0, then n? —0asn—- oo. 

© Ifp=0, then n? = 1. 
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MST121 Chapter B2_ Modelling with matrices 


Networks and matrices 


A physical network can be represented by a network diagram and also by a 
matrix. The entries of the matrix indicate the proportion of the input flowing 
through each pipe of the network. 


The network diagram and the matrix shown below represent the same physical 
network having two input nodes A and B, and three output nodes U, V and W. 


0.4 0.3 
0.2 0.1 
0.4 0.6 


If the outputs of one network feed directly into an equal number of inputs in a 
second network, then the matrix representing the combined network is obtained by 
multiplying the matrices representing the two original networks. 


Arithmetic of matrices and vectors 


A matrix is a rectangular array of numbers. Each number in a matrix is called an 
element. A matrix with m rows and n columns is called an m x n matrix, and has 
size m x n. The element in row 7 and column j of a matrix A is written as a;;. 


Matrix multiplication 


Two matrices A and B can be multiplied only if the number of columns of A equals 
the number of rows of B. The element in the ith row and jth column of the product 
matrix AB is obtained by adding up the products of corresponding elements of the 
ith row of A and the jth column of B. 


Thus, if A is an m x n matrix and B is an n x p matrix, then C = AB is an m x p 
matrix with elements given by 


=> goby UH loaned 7 1,9, .1069). 
k=1 


Qi, Q42 b b h 
For example, if A = | a2; dg. |} and B= ( eee = then 
bo: be2 beg 


Q31 432 


21611 + A22b91  A21b12 + Aa2b22  G21b13 + A22b23 
a31011 + @32b21  @3ibi2 + G32b22  a31b13 + @32b23 


441611 + Ay2b21 Ay b12 + Ay9b22 11613 + ai2b23 
C=AB= 


In most cases, AB #4 BA. The power A” of a square matrix A is formed by 
multiplying together n matrices A; for example, A? = AAA. 
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Matrix addition 


Two matrices A and B can be added only if they have the same size. If A and B are 
m xX n matrices, then C = A + B is also an m Xx n matrix with elements given by 


Cig = O43 + 5,; (= 1,2,05:,9n and 4 =1,2,.05.5%8): 


For example, if A = i — and B= t ba a) then 


Q21 422 423 boi ba2 bag 


a+b, a+b aig + ae 


C=A+Be= 
* Gass 22 + bo2 23 + bo3 


Scalar multiplication 


When a matrix is scalar multiplied by a real number k, each element of the matrix is 
multiplied by k. For example, if 


A= a a12 4 , then kA = Gs kaye ae ; 


Q21 422 423 kag, kag. kag 


The matrix kA is a scalar multiple of the matrix A. 


General properties of matrices 
For any matrices A, B and C of appropriate size, and any real number k: 


A+B=B+A (A+B)+C=A+(B+C) 
(AB)C = A(BC) A(kB) = (kA)B = k(AB) 
AB+AC=A(B+C) 


Vectors 


A vector is a matrix with only one column. Elements of a vector v are often called 
components and are specified as v;. The size of a vector is the number of 
components it has. 


For vectors u and v, the vectors u+ v and ku are formed according to the 
definitions for general matrices. 


Population modelling 


A matrix model for the structure of a population in terms of two interdependent 
subpopulations J,, and A,, is given by 


Pr+i = Mp,, (e=0,1,2,44), 


where M is a 2 x 2 matrix and p,, is the vector ) which gives the 
subpopulation sizes at n years after a chosen starting time. 
The closed-form solution for this model is 


Dp, =M"p,. (= 1,2, 35050): 
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Inverting 2 x 2 matrices 
1 0 
0 1 
AI=JA=A. 


If two 2 x 2 matrices A and B have the property that AB = I = BA, then B is the 
inverse of A. The inverse of a matrix A is usually denoted A~!. The inverse of the 


The matrix T= ( ) is the 2 x 2 identity matrix. For any 2 x 2 matrix A, 


general 2 x 2 matrix ¢ i) is given by 


: ( : a , provided ad — bc 40. 


ad—be\—-ce a 


The determinant of a square matrix is a number calculated from its elements. For 


a2x2 matrix A= (4 p 
c ad 


), the determinant is given by det A = ad — bc. 
Determinant test for invertibility 


If the determinant of a matrix A is not zero, then A has an inverse and A is 
invertible. 


If the determinant of a matrix A is zero, then A does not have an inverse and A is 
non-invertible. 


Solving a pair of simultaneous linear equations using matrices 


Write the simultaneous linear equations in matrix form Ax = b, where A is the 
coefficient matrix, x is the vector of variables and b is the vector with components 
equal to the right-hand sides of the equations. 


If the matrix A is invertible, then the solution is given by x = A~'b. 
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MST121 Chapter B3 Modelling with vectors 


Vectors can be represented: by 2 x 1 matrices (column form), by arrows in the 
(x, y)-plane (geometric form), or by using the Cartesian unit vectors i and j 
(component form). 


Arithmetic of vectors in column form 


The sum of two vectors with the same number of components is formed by adding 
the corresponding components. 

The scalar multiple of a vector a by a real number (scalar) k, denoted by ka, is 
formed by multiplying each component of a by k. 


Arithmetic of vectors in geometric form 


Triangle Rule 


To find the sum a+ b of two vectors a and b in geometric form. 

1. Choose any point P in the plane. 

2. Draw an arrow to represent a, with tail at P and tip at Q, say. 
3. Draw an arrow to represent b, with tail at Q and tip at R, say. 
4 


Draw the arrow with tail at P and tip at R, to complete the triangle PQR. This 
last arrow represents the vector a+ b. 


Parallelogram Rule 


To find the sum a+ b of two vectors a and b in geometric form. 
1. Choose any point P in the plane. 

2. Draw an arrow to represent a, with tail at P and tip at Q, say. 
3. Draw an arrow to represent b, with tail at P and tip at S, say. 
4 


Complete the parallelogram PQRS, and draw the arrow with tail at P and tip at 
R. This last arrow represents the vector a+ b. 


Scalar multiplication 


If a is a vector in geometric form and k is a real number, then the scalar multiple ka 
has magnitude |ka| = |k\|a|. If k is non-zero, then the direction of ka is the same as 
that of a if k > 0, or opposite to that of aif k < 0. 


Arithmetic of vectors in component form 


The sum of two vectors in component form, a = a,i+ a2j and b = b,i+ bgj, is given 
by a+ b= (ay + b,)i + (az + by)j. 


The scalar multiple of a vector in component form, a = a,i+ aaj, by a real 
number k, is given by ka = ka,i+ kagj. 


Converting vectors from geometric form to component form 
A vector a in geometric form, with direction 8, has component form 
a = |a|cos@i+ |a|sin 0 j; 


that is, the i-ccomponent of a is a; = |a|cos@ and the j-component of a is 
a2 = |alsin 6. 
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Converting vectors from component form to geometric form 


A vector in component form, a = a,i+ aj, has magnitude |a| = \/aj + a3. If the 
vector is non-zero, then its direction, in terms of the angle 6 measured anticlockwise 
from the positive x-axis, is obtained from the figures below. 


@ = 180°— @ 


@ = -(180°- 4) 


(a) (b) 


(a) a is parallel to a coordinate axis (b) ¢ = arctan(|a2/a,]) 


Sine and Cosine Rules 


By convention, in a triangle, the vertex labels A, B and C are also used to denote 
the corresponding angle sizes, while the side lengths opposite the angles are denoted 
by a, b and c, respectively. 


In a triangle ABC, if a < b then A < B, and vice versa. 


Sine Rule 


For any triangle, the side lengths a, b, c and corresponding opposite angles A, B, C 
are related by the formulas 


snA sinB sind ‘olen a b Cc 
= = or, equivalently, - = =7 . 
a b q y snA sinB sinC 


Cosine Rule 
For any triangle, the side lengths a, b, c and corresponding opposite angles A, B, C 
are related by the formulas 

b* + ce? — a? 


a? = b? + c? — 2bccos A, cos A = ————__., 
2bc 


+a’? —b 
b? = c? + a? — 2cacos B, ea 
2ca 


247-2 
2 = a’? +b? — 2abcosC, Ce ae 
2ab 
Equilibrium Condition for forces 


If an object is acted upon by n forces, F,,F2,...,F,, and remains at rest in the 
absence of other forces, then the force vectors satisfy the equation 


SUF = Fit Fot-:++Fr=0. 


i=l 


ou 


MST121 Cl 


MST121 Chapter C1 Differentiation and modelling 


Differentiation 


Differentiation is a process which enables you to find: the gradient of a graph, and 
the rate at which one variable changes with respect to another. 


Let f be a function. 


© The derivative f’(x) at a point x in the domain of f is the gradient of the 
graph of f at (x, f(x)), given by 
a ate (LOT HTe) 
f(z) = tim (A , 


provided that this limit exists. 


© Ify= f(x), then f’(x) is the rate of change of y with respect to x. 
A function is smooth if its derivative exists at each point of its domain. 


In Leibniz notation, f’(x) is *(F(0)), or “ where y = f(x). 


ds : 
In Newton’s notation, a § (used only when differentiating with respect to time). 


Table of standard derivatives 


Function f(z) Derivative f’(x) 
c 0 
ie ng! 
sin(ax) acos(azx) 
cos(az) —asin(axr) 
eo* ae“ 
In(az) (ax > 0) lje (ax>D) 


Sum and Constant Multiple Rules 


If k is a function with rule of the form k(x) = f(x) + g(x), where f and g are smooth 
functions, then k is smooth and 


k(x) = f'(x) + 9'(2). 


If k is a function with rule of the form k(x) = cf(x), where f is a smooth function 
and c is a constant, then k is smooth and 


ka) = ef (a). 


Product Rule 


If k is a function with rule of the form k(x) = f(x)g(x), where f and g are smooth 
functions, then k is smooth and 


ki(a) = f'(@)g(@) + fla)g'(@). 
In Leibniz notation, if y = uv, where u = f(x) and v = g(a), then 
dy d du 


ae 
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Quotient Rule 


If k is a function with rule of the form k(x) = f(x)/g(x), where f and g are smooth 
functions, then k is smooth and 


1 9l)f"(2) — fa)g'(2) 
aie Gare 


In Leibniz notation, if y = u/v, where u = f(x) and v = g(x) £0, then 
dy 1 ( du =) 


dx v2 ae nae 


Composite Rule 
If k is a function with rule of the form k(x) = g(f(x)), where f and g are smooth 
functions, then k is smooth and 
k(x) = g'(f(x)) f'(@). 
In Leibniz notation (Chain Rule), if y = g(u), where u = f(x), then 
dy _ dy du 
dx dudx 


Increasing/Decreasing Criterion 

Let J be an open interval in the domain of a smooth function f. 
© If f’(x) > 0 for all x in J, then f is increasing on I. 

© If f’(x) <0 for all x in J, then f is decreasing on I. 


Stationary points 


Let f be a smooth function. The function f has a stationary point at x = 29 if 
f'(ao) = 0. The corresponding point (xo, f(xo)) on the graph of f is also called a 
stationary point. 


First Derivative Test 

To identify the local maxima and local minima of a smooth function f using the 
First Derivative Test proceed as follows. 

1. Find the stationary points of f. 

2. For each stationary point 29, 

(a) choose points x, to the left of zp and xp to the right of x9, such that the 
interval [xL, 2p] lies in the domain of f and there are no stationary points 
between x, and 2, nor between zo and xp, and then calculate f’(r,) and 
(aa) 

(b) classify xo as follows: 
© if f(a) > 0 and f’(rR) < 0, then f has a local maximum at 2; 
© if f’(a_) <0 and f’(xp) > 0, then f has a local minimum at 2; 


© if f’(x_) and f’(apR) have the same sign, then f has neither a local 
maximum nor a local minimum at 2p. 
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Second Derivative Test 
Suppose that 29 is a stationary point of a smooth function f; that is, f’(xo) = 0. 
© If f’(ao) < 0, then f has a local maximum at 29. 


© If f’(xo) > 0, then f has a local minimum at 2. 


Optimisation Procedure 


To find the greatest (or least) value of a smooth function f on a closed interval I 
within the domain of f, proceed as follows. 


1. Find the stationary points of f. 


2. Evaluate f at each of the endpoints of J and at each of the stationary points 
inside J. 
3. Choose the greatest (or least) of the function values found in Step 2. 


MST121 Chapter C2 Integration and modelling 


Integration 


The function F' is an integral of the function f if F’ = f. The indefinite integral 
of f(a) is 


[f@dr= FO) +6, 


where F is an integral of f and c is an arbitrary constant, also called the constant 
of integration. 


The definite integral of a continuous function f from a to b, denoted by 


[ f0o)ae, 
is defined to be 
[F(x)], = F(b) -— F(a), 


where F is any integral of f. 


Table of indefinite integrals 


Function f(z) Integral [fo dx 
a (constant) ax+c 
1 
n = n+1 
xe (n 4 —-1) ar had +c¢ 
1 
— (x>0) Inz+c 
a 
1 
ett ett +¢ 
a 
D. 
cos(az) —sin(ar) +¢ 
a 
1 
sin(az) ——cos(axz) + ¢ 
a 
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Rules for integration 


The Sum Rule for integrals is 


[U@+9@)ae= f faydet f oo) ae. 


The Constant Multiple Rule for integrals is 


[ofo) = bf Fa) ae 


The combined Sum and Constant Multiple Rule is 


[(af(@) + g(a) de = a | F(x) de +b f g(a) dz. 


Double-angle formulas 
sin(20) = 2sin @cos 0 


cos(20) = cos? @ — sin? 6 = 2cos?9— 1=1—2sin’6 


Two integration formulas 


[U@y sf @) ae = wy +e (nA 1) 


F(z) = Il iL Cc L 
[Foy = Ute (F@)>0) 


Modelling motion 

The SI units for kinematic quantities are as follows. 

© Time is measured in seconds (s). 

© Position is measured in metres (m). 

© Velocity is measured in metres per second (ms~'). 

© Acceleration is measured in metres per second per second (m s~’). 

Time ft, position s, velocity v and acceleration a are related by the equations 
ds du 


The following formulas apply for the motion of a particle along a straight line with 
constant acceleration a, if at time t = 0 the particle has velocity vg and position 59. 


© The velocity v of the particle is given by 
v=at+ Vo. 
© The position s of the particle is given by 
s= zat? + Vot + So. 
© The velocity and position of the particle are related by the equation 


v’ — 2as = v4 — 2asp. 
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Finding the area under a graph 


If f(x) is a continuous function that takes no negative values for a < x < b, then the 
area of the region bounded by the graph of y = f(x), the x-axis, and the lines x = a 
and x = J, is equal to the definite integral 


| ” Fe) ae. 


Fundamental Theorem of Calculus 


If f is a function which is continuous on the interval [a, b], then 


N-1 


[ f(a) de = jim bs hf (a + th) 


b-—a 


here h = 
, where v 


MST121 Chapter C3 Differential equations and modelling 


Differential equations 


A differential equation is an equation that relates an independent variable, x say, 
a dependent variable, y say, and one or more derivatives of y with respect to x. The 
order of a differential equation is the order of the highest derivative that appears in 
the equation. A first-order differential equation involves the first derivative, dy/dz, 
and no higher derivatives. 


A solution of a differential equation is a function y = F(x) (or a more general 
equation relating x and y) for which the differential equation is satisfied. The 
general solution of a differential equation is the set of all possible solutions of the 
equation. It usually involves one or more arbitrary constants. A particular 
solution of a differential equation is a single solution of the equation, which consists 
of a relationship between the dependent and independent variables that contains no 
arbitrary constant. 


An initial condition associated with a first-order differential equation requires that 
the dependent variable y takes a specified value, b say, when the independent 
variable x has a given value, a say. This is often written as 


y=bwhenz=a, oras y(a)=b. 


The numbers a and 0 are called initial values for x and y, respectively. The 
combination of a first-order differential equation and an initial condition is called an 
initial-value problem. The solution of an initial-value problem is a particular 
solution of the differential equation which also satisfies the initial condition. 


Direct integration 


The general solution of the differential equation dy/dx = f(x) is the indefinite 
integral 


y= f f(z)de = Fla) +0, 


where F(z) is any integral of f(x) and c is an arbitrary constant. Any initial 
condition 


y=bwhenz=a, thatis, y(a)=6), 
enables a value for the arbitrary constant c to be found. The corresponding 


particular solution satisfies both the differential equation and the initial condition. 
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Implicit differentiation 


If y is a function of and H(y) = F(x), then H’(y)— = F’(z). 


Separation of variables 
The method applies to differential equations of the form dy/dx = f(x)g(y). 
1. Divide both sides by g(y), for g(y) 4 0, to obtain 
1 dy 
—_— = f(x). 
gly) dx " 


2. Integrate both sides with respect to x. The outcome is 


[aun [tox 


3. Carry out the two integrations, introducing one arbitrary constant, to obtain the 
general solution in implicit form. If possible, manipulate the resulting equation 
to make y the subject, thus expressing the general solution in explicit form. 


Modelling growth and decay 


The differential equation dy/dx = Ky, where K is a constant, has the general 
solution y = Ae**, where A is an arbitrary constant. 


The process of radioactive decay, that is, the change in mass m of a radioactive 
substance that is present at time t, can be modelled by the initial-value problem 
dm 
dt. 
Here k is a positive constant, called the decay constant, and mp is the initial mass 
of the substance. This initial-value problem has the solution 


=-km (m>0), m=mp whent=0. 


m= mee. 

The half-life T of a radioactive substance is the time it takes for the mass of 
radioactive substance to diminish to half its original amount. In the model, we have 
T = (In2)/k. The value of the decay constant k can be estimated from data by 
plotting In(m/m,) against t. This should approximate a line through the origin with 
gradient —k. 


The process of population change, that is, the change in the population size P over 
time t, can be modelled by the initial-value problem 


dP 
an KP (P>0), P= Py) whent=0. 


Here K is a constant, called the proportionate growth rate, and P is the initial 
population size. This initial-value problem has the solution 


P= Pye**. 


If K <0, then the population is decreasing and the half-life of the population can be 
defined as for radioactive decay. If kK > 0, then the population is increasing and the 
doubling time T of the population is the time it takes for the population to double 
in size. In the model, we have T = (In2)/K. The value of the proportionate growth 
rate K can be estimated from data using a log-linear plot of In P against t. This 
should approximate a line which crosses the (In P)-axis at In Py and has gradient K. 


MST121 C3 


43 


MST121 C3 


Euler’s method 


Euler’s method for solving the initial-value problem 


d 
4 = f(2,y), y(z0) = yo 


is described by the pair of recurrence relations 
Goi =e bh, Ves =n PR esta) 4e=0,1,2,44.), 


where h is the step size between the successive values of x at which solution 
estimates are calculated. Each calculated value y, is an estimate of the 
corresponding ‘true solution’ y at x = ,; that is, y, is an estimate of y(z,,). The 
sequence of estimates depends on the choice of both the step size h and the overall 
number of steps NV. Decreasing h, while increasing N to cover the same range of 
z-values, leads to progressively improved estimates for the solution values, and with 
a small enough step size, any desired level of accuracy can be achieved. 


44 


MST121 D1 


MST121 Chapter D1 Chance 


Probability 

For any event E, 0 < P(E) <1. 

If an event E never happens, then P(E) = 0. 

If an event E is certain to happen, then P(£) = 1. 


If an experiment has N equally-likely possible outcomes, and n(£) is the number of 
these outcomes that give rise to an event E, then 
_ (Z), 


that is, P(E) is equal to the number of outcomes for which the event E occurs 
divided by the total number of possible outcomes. 
If FE is an event and not-E is the opposite event (that FE does not occur), then 
P(E) + P(not-F) = 1, 
or, equivalently, 
P(E) =1- P(not-E). 


Two events are independent of each other if the occurrence (or not) of one is not 
influenced by whether or not the other occurs. 


The multiplication rule for independent events states that if FE and F are 
independent events, then 


P(E and F) = P(E) x P(F). 


Geometric distributions 


If a sequence of trials of an experiment is carried out and the probability of success 
in each trial is p (0 < p < 1) independently of the results of earlier trials, then X, the 
number of trials required to obtain a success, has a geometric distribution. The 
probability function of X is given by 


PX =jjH= (leap) p,, 9 = 1, 28. is: 


The mean number of trials required to obtain a success is 1/p. 


Probability distributions 


The mean of the probability distribution of a discrete random variable X is denoted 
by p and is defined to be 


w= S1Ix P(X =)), 
Fi 
where the summation is over all values 7 which X can take, that is, for which 
PLL =F) > 0. 


The corresponding formula for the mean of a continuous random variable X with 
probability density function f is 


p= 2f@yar. 
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MST121 Chapter D2 Modelling variation 


The variance of a random variable X or of a probability distribution is the mean of 
the values (a — 44)*, where the mean is taken over all values x that X can take. The 
standard deviation of a random variable or of a probability distribution is the 
square root of the variance. 


When a probability distribution is used to model the variation in a population, the 
mean of the distribution is called the population mean, and the standard deviation 
is called the population standard deviation. The population mean and the 
population standard deviation are examples of population parameters. 


Sample statistics and population parameters 
For a sample of n observations 11, %2,...,%,, the sample mean 7 is given by 


7 il 12 
B= —(2, +a2+++ +a) =—) as, 


The sample mean and sample standard deviation are examples of sample 
statistics. In general, given a sample of data from a population, the sample mean 7 
is used to estimate the population mean jz, and the sample standard deviation s is 
used to estimate the population standard deviation o. 


Normal distributions 


A normal distribution is a continuous probability distribution. Probabilities are 
calculated by finding areas under a normal curve, which has the following typical 
shape. 


f(x) 4 
) 
| 
7 " 


The probability density function of a normal distribution is the function f, where 

y = f(a) is the equation of the normal curve. It is defined for all real values of x. 
The equation of a normal curve contains two parameters, j and a: yu is the mean of 
the distribution, and a is its standard deviation. The curve is symmetric about its 
peak, which occurs at x = yw. The normal distribution with mean 0 and standard 
deviation 1 is called the standard normal distribution. 


If a normal distribution is used to model the variation in a population, then, 
according to the model, the proportion of the population within k standard 
deviations of the mean is the same whatever the values of the mean yu and the 
standard deviation o. In particular, approximately 95% of the population are within 
1.96 standard deviations of the mean (that is, between 4 — 1.960 and w+ 1.960). 
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Sampling distributions, the Central Limit Theorem and confidence intervals 


The sampling distribution of the mean for samples of size n from a population with 
mean p and standard deviation o has mean yz and standard deviation a/./n. These 
results hold for any sample size. 


The standard deviation of the sampling distribution of the mean is called the 
standard error of the mean and is denoted by SE. 


The Central Limit Theorem states that, for large sample sizes (at least 25), the 
sampling distribution of the mean for samples of size n from a population with mean 
p and standard deviation o may be approximated by a normal distribution with 
mean p and standard deviation 


ol 
SE = —. 
Jn 
Given a sample of size n from a population, a 95% confidence interval for the 
population mean p is given by 


s 8 
z — 1.96 —, 7% + 1.96 —— 
(z gen =) ; 


where % is the sample mean and s is the sample standard deviation. The sample size 
n must be at least 25. 


MST121 Chapter D4 ‘Further investigations 


Summary statistics and boxplots 


The median is essentially the middle value of a batch of data when the values are 
placed in order of increasing size. If the batch size is odd, then the median is the 
middle value. If the batch size is even, then the median is the mean of the middle 
two values. 


The lower quartile, which is denoted by Q1, is the median of the values to the left 
of the median. 


The upper quartile, which is denoted by Q38, is the median of the values to the 
right of the median. 


The interquartile range is the difference between the upper quartile and the lower 
quartile. 


The range is the difference between the maximum value and the minimum value. 


A boxplot is a diagram for representing a batch of data. A typical boxplot is shown 
below. 


_ Qi median Q3 
minimum maximum 
| 


NN MN MA. 


4 _ 4 
25% 25% 25% 25% 


The box extends from the lower quartile to the upper quartile, and a vertical line is 
drawn through the box at the median. The whiskers extend from the ends of the box 
to the minimum and maximum values in the batch of data. 
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Hypothesis testing 


There are three stages involved in a hypothesis test: 

1. Set up the null and alternative hypotheses. 

2. Calculate the test statistic. 

3. Report conclusions. 

The two-sample z-test is a hypothesis test which may be used when a sample of at 
least 25 observations is available from each of two populations. It may be used to 


investigate whether there is a difference between the means of the populations. The 
three stages involved in carrying out the two-sample z-test are outlined below. 


Stage 1: Hypotheses 
Set up the null and alternative hypotheses: 


Ao: 14 = Hp, 
Ay: fa F bp, 


where {4,4 and fp are the means of populations A and B, respectively. 


Stage 2: The test statistic 


Calculate the test statistic 
TA— LB 
Z= —_,, 


ESE 


where 


2 2 
ESE =,/-4+4 “8, 
NA NB 


&, and Xp are the sample means, s,4 and sp are the sample standard deviations, and 
na and np are the sizes of the samples from A and B, respectively. 


Stage 3: Conclusions 


© If z<-—1.96 or z > 1.96, then Ho is rejected at the 5% significance level in 
favour of the alternative hypothesis. 


© If -1.96 < z < 1.96, then Hp is not rejected at the 5% significance level. 


The conclusion should be expressed in terms of the hypothesis being tested. 


The quantity ESE in the test statistic for the two-sample z-test is the estimated 
standard error of the difference between two sample means; that is, it is the 
estimated value of the standard deviation of the sampling distribution of the 
difference between two sample means. 


Fitting lines to data 


The least squares fit line for a set of data points is the line that minimises the 
sum of the squared residuals for the data set. It is also known as the regression 
line of y on x. It may be used to predict values of y, the dependent variable, for 
values of x, the explanatory variable, but not vice versa. It should be used only to 
predict y-values for x-values that are within, or just outside, the range of values of x 
represented in the data. 
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